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ABSTRACT 

We discuss the feasibility of measuring the cosmological geometry using the redshift 
space correlation function of the Lya forest in multiple lines of sight, as a function 
of angular and velocity separation. The geometric parameter that is measured is 
f(z) = c _1 H(z)D(z), where H(z) is the Hubble constant and D(z) the angular 
diameter distance at redshift z. The correlation function is computed in linear theory. 
We describe a method to measure it from observations with the Gaussianization 
procedure of Croft et al. (1998) to map the observed Lya forest transmitted flux 
to an approximation of the linear density field. The effect of peculiar velocities on 
the shape of the recovered power spectrum is pointed out. We estimate the error in 
recovering the f{z) factor from observations due to the variance in the Lya absorbers. 
We show that ~ 20 pairs of quasars (separations < 3') are needed to distinguish a flat 
$7o = 1 universe from a universe with = 0.2, £l\ = 0.8. A second parameter that is 
obtained from the correlation function of the Lya forest is f3 ~ Q(z)°' 6 /6 (affecting the 
magnitude of the peculiar velocities), where 6 is a linear theory bias of the Lya forest. 
The statistical error of f(z) can be reduced if b can be determined independently from 
numerical simulations, reducing the number of quasar pairs needed for constraining 
cosmology to approximately six. On small scales, where the correlation function is 
higher, f(z) should be measurable with fewer quasars, but non-linear effects must then 
be taken into account. The anisotropy of the non- linear redshift space correlation 
function as a function of scale should also provide a precise quantitative test of the 
gravitational instability theory of the Lya forest. 



Subject headings: cosmology: theory — intergalactic medium — large-scale structure 
of universe — quasars: absorption lines 



1. Introduction 

One of the methods to measure the parameters of the global cosmological metric of the 
universe is to observe the angular size and the redshift extent of a set of objects, which can be 
assumed to be spherically symmetric and to follow the Hubble expansion on average (Alcock 
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& Paczyriski 1979). More generally, this geometric factor can be measured from a correlation 
function of any set of objects depending on angular separation and redshift difference, by requiring 
that the correlation is isotropic. It has long been known that this measurement at high redshift is 
sensitive primarily to the cosmological constant (Alcock & Paczyhski 1979). However, generally 
the correlation of objects is due to gravitational collapse, and the peculiar velocities make the 
correlation function in redshift space anisotropic (Kaiser 1987). The effect of peculiar velocities 
must be taken into account before the method can be applied. 

Recently, a method has been developed to recover the power spectrum of mass fluctuations 
from quasar absorption spectra, by measuring the one-dimensional power spectrum and converting 
it to the desired three-dimensional power spectrum (Croft et al. 1998, hereafter CWKH). This 
method suffers from peculiar velocity distortions similar to those which distort the isotropy of the 
correlation in redshift space. Here, we consider the accuracy in the measurement of the redshift 
space correlation function from the Lya forest in nearby pairs of quasars. We show that it is 
possible to disentangle the effects of geometry and peculiar velocities, and recover the power 
spectrum of mass fluctuations from the correlations in the Lya forest. We also demonstrate that 
the peculiar velocities are important for correctly deriving the shape of the power spectrum. 

In §2 we summarize equations describing the cosmological geometry, discuss the linear theory 
correlation function in redshift space, and comment on the effect of peculiar velocities on the 
shape of the power spectrum. In §3 we use a random line model to estimate the error in the 
measurement of f{z) from a given number of observed quasar spectra. The discussion is given in 
§4. Figures ^-|| contain the main results of this paper. 

2. Linear Theory of the Lyman Alpha Forest Correlation Function 

2.1. Cosmological Geometry 

The following is a short summary of the cosmological equations we shall use (these have 
been discussed earlier in several papers, e.g., Matsubara & Suto 1996, Ballinger et al. 1997). 
Observations directly measure the redshift and angular position of each object. We write the 
angular and redshift separation between two objects as (Az, A9). The redshift is caused both by 
Hubble flow velocities {v^) and peculiar velocities along the line of sight (v p ). The total velocity 
separation along the line of sight is Avn = cAz/{\ + z) = Avh + Av p . It is also convenient to define 
a perpendicular velocity separation, Av± = cf(z)A9, where f(z) is a dimensionless function of 
redshift that includes all the dependence on the global cosmological metric. With the assumption 
of isotropy, the real space two-point correlation function of density fluctuations ^ r (Avh, Av±) 
must be a function of J Av^ + Avj_ only. If £ r could be measured, it would be a relatively 
straightforward matter to measure f(z) by simply demanding isotropy. In reality, distances cannot 
be measured accurately and only the redshift space correlation function £ can be determined, 
which is affected by peculiar velocities. The peculiar velocities introduce an anisotropy in £ of the 
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same order as the difference in f(z) between various cosmological models. 

The quantity f(z) is predicted for any cosmological model. If the present density of matter 
(in units of the critical density) is Qq, and considering also a negative pressure component with 
density £1\ and equation of state p = wp (the case w = — 1 is the cosmological constant), we have, 
for an open universe, 

_ £(z)sinh [Vn^J z (dz/E(z))] 

and for a flat universe, 

_ E(z)f*(dz/E(z)) 
J[Z) ~ (1 + z) • [Z) 

Here, flu = 1 — f^o — £l\ (Qr = for a flat universe), and 



E(z) = yJOo^l + z)3 + n R (l + Zf + J2 A (1 + z)3(l+to) . (3) 

Figure [l| shows predictions for /(z). 



2.2. The Correlation Function in Redshift Space 

In linear theory, the redshift space correlation function of the density field is given by (Kaiser 
1987; Lilje & Efstathiou 1989; Hamilton 1992; Fisher 1995) 

£(At,|,, At;j_) = (l + ~/3 + ^ & W - + ^/5 2 ) 6(s)P 2 (/u) + (^/3 2 ) ^W^Cm) , (4) 

where s = ^Auy + Auj_, /i = Avy/s, are the usual Legendre polynomials, and 

u2 poo 

600 = ^2 7 dkk 2 P(k)jt [ks(l + z)/F(z)] . (5) 

The functions j;(x) are the usual spherical Bessel functions, and P(k) is the power 
spectrum of the mass fluctuations. The parameter (3 is related to the linear theory bias b 
by (3 = b~ 1 H(z)^ 1 (dD/dt)/D, where D is the linear growth factor, H(z) is the Hubble constant 
and t is the age of the universe [see Peebles 1993; H (z)- 1 (dD / dt) / D ~ n(z) - 6 is a good 
approximation in most models]. 

Equation (||) is valid only in linear theory, and the correlation function £ could only be 
determined directly from observations if the linear density field, 8, were known. Only the fraction 
of the flux that is transmitted, F, can be determined along a line of sight from the Lya forest 
spectrum in a quasar (we assume here that the quasar continuum has been fitted to a model, 
allowing the transmitted flux fraction F to be measured at every pixel). We need a way to recover 
the linear density field 5 from the observed F. In general, this cannot be done exactly since F is 
only known along a line of sight, and besides the chaotic nature of non-linear evolution limits the 




Fig. 1.— f(z) = Av L /{cA6) computed for ft = 1-0, = 0.0 (solid line), Q = 0.4, Q A = 0.6, 
w = — 1 (long dashed line), fio = 0.3, 17 a = 0.0 (short dashed line), and = 0.4, Qa = 0.6, 
w = —2/3 (dotted line). 
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accuracy to which 8 can be recovered. Here, we adopt the Gaussianization procedure of CWKH 
(see also Weinberg 1992). Gaussianization assumes that the initial mass density fluctuations were 
Gaussian random and that evolution approximately preserves the rank order of densities, even as 
it distorts the distribution from a Gaussian. In the case of the Lya forest, values of the observed 
flux decrement are mapped monotonically onto a new variable 5 required to have the probability 
distribution function 
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Of course, the fluctuation amplitude \/£(0) is not recovered by the assumption that the rank order 
is preserved. Computing the correlation function of the Gaussianized spectrum yields only the 
ratio f(Au, A0) = £(Av, A0)/f(O). 

Once the new variable 8 is obtained, the correlation function may be computed from 
observations through the usual estimator given by its definition: ((Av) = < <5(v) <5(v + Av) > , 
where Av symbolizes the vector separation (Auy, Av±). The average is taken over all pairs of 
pixels separated by Av in the spectra available. However, this is not necessarily the best estimator 
of (, and in general it should be better to examine the full 2-point joint probability distribution 
function. One of the main reasons for this is that the assumption that the density rank order is 
preserved should obviously break down at high optical depths, because the gas at high densities 
will typically be shocked and follow a highly stochastic evolution, developing small-scale structure. 
The spectrum will also be greatly affected by velocity caustics and thermal broadening, and when 
t»1 the value of 8 derived from Gaussianization will have large errors because of saturation. 
On the other hand, at low densities the evolution is much more regular, velocity caustics do not 
appear, and thermal broadening can be neglected, so there should be a good correspondence 
between the optical depth at a given velocity and the gas density at the corresponding point in 
space. 

The 2-point distribution function for a Gaussian field is 



P 2 (*5i,<5 2 ,Av) 



This can be rewritten as 
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(8) 



Thus, for every value of Si, we can estimate ((Av) from the distribution of 82 conditional to the 
value of Si. For example, one could estimate ((Av) from the median of 82 (or any other adequate 
percentile), and see how the result depends on 8\. If the method works, ( should be approximately 
independent of 81 for low values of 8\. Using instead ((Av) = <<5(v) <5(v + Av) > is equivalent to 
averaging the estimate of ( over all values of 81 with equal weights. 
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2.3. The Effect of Peculiar Velocities on the Power Spectrum 

In their reconstruction of the linear power spectrum from Lya forest lines, CWKH neglect 
the effect of peculiar velocities on the shape of P(k). This effect arises in the conversion from the 
measured one-dimensional power spectrum P\d to the desired three-dimensional power spectrum 
P"iD- While the shape of P 3 d is not affected by peculiar velocities in linear theory (Kaiser 1987), 
P\b is affected as shown by Kaiser & Peacock (1991): 

PlD(k\\) = — / dkkP 3D (k)(l + PkfJk 2 ) 2 , (9) 

Jkn 

To demonstrate how this will affect the reconstruction of CWKH, we substitute a specific 
PzD{k) into equation (^), with various values of f5. We use the cold dark matter power 
spectrum parameterization of Bardeen et al. (1986), with the coefficients of Ma (1996) (see §3.1 
below), and the parameters: Q = 1, h = 0.5, n = 1.0. We then assume that the resulting 
P\D is the one-dimensional power spectrum for the Lya forest and attempt to reconstruct the 
three-dimensional power spectrum ignoring peculiar velocities (i.e., setting j3 = 0). This yields a 
new function P 3 £,{k), different from the correct P 3 o{k) because of the effect of peculiar velocities, 
which is given by 

POO 

P 3D (k) = P 3D {k){l + f3) 2 - dk' 

Jk 

The second term causes the change in the shape of P 3 £>(k) relative to P 3 £>(k). Figure ^ shows the 
results of this procedure for various values of j3. The error in the reconstructed power spectrum 
grows with scale and eventually causes the results to become negative. The power spectrum used 
in this figure was smoothed by a Gaussian with radius r s = 0.24/i _1 MpcQ. CWKH found that this 
was necessary to match the P(k) they reconstructed from simulations. 

Thus, the shape of the mass power spectrum P 3 £,{k) cannot be recovered until the parameter 
f3 is known. How can this parameter be determined? One could in principle attempt to determine 
(3 from observations in multiple lines of sight, as described in the next section. The anisotropy 
of the correlation function depends on the two parameters (5 and f(z). But as we shall see, a 
very large amount of data will be required to measure both of them independently. If (3 can be 
predicted from theory, it should be much less difficult to determine the power spectrum and f(z). 

Linear theory would predict 6 = 2 for a constant temperature in the intergalactic medium, 
because the optical depth (which is the quantity that is modified by peculiar velocities in the 
mapping from real to redshift space) is proportional to the neutral hydrogen density, or the 
square of the gas density. In a photoionized medium, the gas temperature is determined by a 



fPsn(k')(l+^. 



(10) 



1 The value of r s quoted in CWKH (in the caption of their Fig. 2) was wrong by a factor 2tt; Croft 1998, priv. 
communication. 
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Fig. 2. — (a) The power spectrum as it would appear if reconstructed from Pm(k) while ignoring 
peculiar velocity effects. From top to bottom, (3 = 0.0,0.2,0.4,0.6,0.8, 1.0. The (3 = line is the 
correct power spectrum. The normalization for all curves is fixed at large k, and a Gaussian cutoff 
has been applied, (b) The ratio of the reconstructed to the true power spectra. 



-8- 



balance of photoionization heating and adiabatic cooling. The heating rate is proportional to the 
recombination rate, ctp, and the adiabatic cooling is proportional to the temperature T. Since 
a oc T~ 0,7 , the relation T oc /? ' 6 is set up if the gas temperature is not affected by shocks or by 
reionization (see Hui & Gnedin 1997, Croft et al. 1997). This leads to a neutral hydrogen density 
proportional to ap 2 oc p 1-6 , and therefore a Lya forest bias b = 1.6. Since Q(z) ~ 1 at z ~ 3 for 
viable models, [3 ~ 0.6. However, in reality the bias depends on the relation of optical depth to 
the initial density on small scales, where non-linearities are important, and therefore the correct 
bias to use in equations (6), (9), and (10), where linear theory is applied to large scales using the 
Gaussianization approximation, could be substantially different. Numerical simulations can be 
used to calculate a better value for the Lya forest bias (which will in general depend on redshift), 
but of course this will only be accurate if the simulations are modeling the structure of the Lya 
forest correctly. 

The effects of peculiar velocities on the recovery of the mass power spectrum have 
independently been pointed out in a recent paper by Hui (1998), which appeared as this paper 
was being completed. 



3. Errors in the Measurement of the Lya Forest Correlation 

In this section, an estimate is obtained of the statistical error in measuring the parameters 
of the correlation function, due to the random nature of the absorption lines that appear in 
the spectra. A simple model of absorption lines will be used to generate random spectra that 
reproduce the characteristics of the individual Lya forest absorption lines without the large scale 
correlation. 



3.1. Parameterization of the Correlation Function 

As discussed in §2, two parameters describe the anisotropy of the correlation function: f(z), 
reflecting the effect of the cosmological geometry, and (3(z), incorporating the peculiar velocity 
effects. We also parameterize the shape of the power spectrum to a fitting formula for cold dark 
matter models, 

p( ks ) = KiMi +aiq )/ ai e ^ (n) 

[1 + a 2 q + {a 3 q) 2 + (a 4 q) 3 + {a 5 q) 4 } 2 

where q = k s /T(z). We have reexpressed the power spectrum in terms of k s = k(l + z)/H(z). 
The free parameters of this model for the shape of the power spectrum are n and T(z) (given by 
T(z) = (1 + z)Q h 2 1 H (z)) . The formula is given by Bardeen et al. (1986), but we modify the 
parameters to the fit for tlb = 0.05: a± = 2.205, a 2 = 4.05, 03 = 18.3, 04 = 8.725, and 05 = 8.0 
(Ma 1996). In addition we include a smoothing parameter by multiplying the above formula by 
the factor exp(— k 2 v 2 /2), with v s = r s H(z)/(l + z) = 48kms _1 at z = 3. This is motivated by the 
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result of CWKH, who find that the Lya forest has a power spectrum (after Gaussianization) that 
can be well approximated by a smoothed version of the power spectrum of the mass. We fix v s 
(rather than r s ) to the same value for all models, since the observations always yield separations 
in terms of velocity. 

In Figure ||, we display the computed £(Av,A0) along the line of sight (A9 = 0), and 
at separations AO = 127" and AO = 300", at mean redshift < z >= 2.25. Because of the 
Gaussianization, we measure £(Au,A#) = £(Au, A0)/£(O). The model parameters are f2o = 1-0, 

= 0.0, h = 0.65, n = 1.0, and (3 = 0.6. These parameters correspond to f(z) = 1.61, and 
T(z) = 0.0036 (km/s) -1 . Also shown in the fi gure are the predictions for a model with f(z) = 1.39 
(the value for S7o = 0.4, $7 a = 0.6), but with T, n, (3 and v s unchanged. 

3.2. Analysis of Random Spectra 

In this subsection we use a random line model to estimate the noise in the measurement 
of ^(Av,A9). We create Lya forest spectra using a code which produces lines by randomly 
distributing Voigt profiles with a specified distribution of column densities N and widths b. 
Figure |] shows a piece of one of these spectra. We take a set of parameters that closely match 
the distribution in the observations (e.g., Kim et al. 1997). We will consider the mean redshift 
< z >= 2.25. We set the number of lines per unit redshift with column density greater than 
10 14 cm~ 2 to be N >14 = 50, and set f(N) oc iV" 1 - 35 with a break to iV" 1 - 7 at log(iV) = 14.3. We 
set the mean b-parameter of the Voigt profiles to be < b >= 30kms _1 with a Gaussian dispersion 
of aj, = 12kms _1 and a lower cutoff of b cu t = 24kms~ 1 . 

A spectrum is generated by calculating the transmitted flux at discrete pixels from the list 
of randomly generated lines. We then make a transformation of the transmitted flux to a new 
variable 5, requiring that the probability distribution of 5 is Gaussian (this is the Gaussianization 
procedure). Because the absorption lines are random, there should be no correlation at separations 
beyond the width of the individual components, so any correlation we measure at larger separations 
is due to noise. Figure [5] shows the resulting £(Av, 0) for a pair of lines. The self-correlation of 
the individual components due to their own width extends out to Av ~ 170 km s -1 . The rms 
fluctuation around the mean £ is ~ 0.03. The errors in £ are obviously also correlated over 
~ 100 km s -1 due to the width of the lines. 

3.3. Estimation of Errors 

We now estimate the errors in measuring parameters in the correlation function from a given 
set of quasar spectra. We consider first as an example the triplet of quasars of Crotts & Fang 
(1998), which have a useful redshift range z ~ 2.0 — 2.5 and separations AO = 127" , 147", and 
177". The error bars we have computed for £ are not necessarily Gaussian, and certainly not 
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Fig. 3. — The solid and long dashed lines are predictions for a flat model with 0,q = 1.0, h = 0.65, 
and (3 = 0.6 (giving f(z) = 1.61). The solid line is the correlation along the line of sight (normalized 
to £(0>0) = 1 because of Gaussianization). The upper dashed lines are for A6 = 127", and the 
lower dashed lines are for A6 = 300". The short dashed lines were produced by changing f(z) to 
1.39, appropriate for a flat f^A = 0.6 model, without changing the power spectrum parameters or (3 
(the line of sight correlation is the same in each case). 
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Fig. 4.— 



Flux vs. velocity for a randomly generated spectrum. 
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Fig. 5. — The correlation function along the line of sight for a pair of randomly generated spectra 
(Az = 0.5, < z >= 2.25 for each). 
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independent, so we use a Monte Carlo technique. We introduce noise to the calculated correlation 
function of a model by adding the noise values of the correlation measured from spectra of 
randomly generated lines (i.e., we sum the curves in Fig. 3 and 5). We then fit the five parameters 
of the model, minimizing the x 2 °f the correlation function in linear bins of ~ 45kms _1 , using 
the error bars found from the dispersion in the correlation obtained from the spectra of random 
lines. We repeat this procedure independently 25 times and fit each of the 25 simulated correlation 
functions separately. This should give the range of the fitted values we would expect to obtain from 
data with some fixed true parameter values. We do not use the correlation function at separations 
less than 300 km s -1 , approximately the scale of non-linearities. Changing this restriction to 
200 km s -1 results in a small improvement (particularly, tighter correlation between / and 0), but 
does not substantially change our conclusions. Figure |6] shows the results for the f(z) = 1.61, 
(3 = 0.6 model, fit to the triplet of quasars described above. 

To reduce the scatter, it is necessary to add more quasars. Figure [?] shows the improvement 
that can be expected by combining multiple pairs of quasars at different angular separations (but 
still at the same redshift). Six pairs with separations A9 = 45", 75", 105", 135", 165" and 195", 
with useful redshift range z = 2.0 — 2.5, are combined. In Figure ||, we use the same 6 angular 
separations but reduce the noise in the cross-correlation by a factor of two, and reduce the noise 
in the LOS correlation by a factor of six. This should approximately represent 24 pairs of quasars, 
with an additional 384 single quasars included to improve the measurement of the line of sight 
correlation. An independent determination of (3 can also improve the measurement of /. In Figure 
H we plot the distribution of measured / for f3 = 0.2, (3 = 0.6, and (3 = 1.0. For these fits, the 
value of (3 is fixed at the correct value. The sets of quasars used are the 6 pairs of Figure [j]. 
Finally, with fixed /?, we found that / can be measured to ~ ±10% (la) using the reduced noise 
(24 quasar pairs) of Figure ||. 

4. Discussion 

We have presented a method for analyzing the Lya forest spectra along parallel lines of 
sight in order to extract the geometric parameter f(z) and the quantity determining the strength 
of peculiar velocity distortions j3(z) ~ f2°' 6 /6. We have worked here in the context of the 
linear regime, where the correlation function is small and the peculiar velocities generally cause 
large-scale structures (with positive or negative density fluctuations) to be flattened along the 
line of sight. Using the Gaussianization technique of Croft et al. (1998), one can measure the full 
redshift space correlation function and obtain (3 and f(z). Using a model in which we distribute a 
realistic set of discrete lines randomly in space we have estimated the statistical errors expected 
for a measurement of the correlation function. We estimate that dozens of pairs of quasars are 
needed to reliably distinguish competing cosmological models. 

The parameter / should probably be easier to measure going to smaller separations, in the 
non-linear regime. The amplitude of the correlation is then much larger, so many fewer pairs of 
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Fig. 6. — Best fit values for f3{z) and f(z) for 25 sets of random spectra. The sets of quasar spectra 
fit were random realizations of a triplet with angular separations 127", 147", and 177", with a 
useful redshift range z = 2.0 — 2.5. The true values were f3(z) = 0.6 and f(z) = 1.61. f(z) = 1.61 
corresponds to Qq = 1.0 and = 0.0 at redshift < z >= 2.25. Two points with high j3 do not 
appear on the plot. The constraint (3 > was applied to the fits. 
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Fig. 7. — Best fit values for 25 realizations of a combination of 6 pairs of quasars separated by 
A6 = 45", 75", 105", 135", 165", and 195". The true parameter values were 0{z) = 0.6 and 
f(z) = 1.6 (f\) = 1.0, < z >= 2.25). The constraint (3 > was applied to the fits. 
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Fig. 8. — This figure shows the same 25 fits as Fig. [7], except the added noise in the cross-correlation 
has been reduced by a factor of 2, and the added noise in the line of sight correlation has been 
reduced by a factor of 6. 
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Fig. 9. — Fits assuming (3 is known {(5 is held fixed at the correct value). The error bars represent 
approximately 92% confidence. The 25 sets of quasars fit each contained 6 pairs as in Fig. |. The 
true value of / is / = 1.6. The error bars exclude one point at each extreme. We show (5 = 0.2, 
P = 0.6, and (3 = 1.0. 



-18- 



quasars should be needed to measure the anisotropy of the correlation function to a fixed relative 
accuracy, required to constrain / to interesting levels. For small separations, the effect of peculiar 
velocities should be opposite to that in the linear regime: the correlation function should be 
elongated along the line of sight. This is caused by the well-known "fingers of God" effect in 
galaxy redshift surveys (where high density clusters appear as highly elongated filaments pointing 
toward us). In the Lya forest, the "fingers of God" effect is simply the contribution of the internal 
velocity dispersion (either hydrodynamic or thermal) of the absorbers to their width. 

The disadvantage of working in the non-linear regime is that the anisotropy of the correlation 
function can only be predicted with numerical simulations; in addition, the 2-point joint probability 
function is no longer given by equation (9), because the density field is no longer Gaussian. 
Therefore, we can only measure / if we can be certain that numerical simulations are accurate 
enough. But a more important reason to measure the 2-point function in the non-linear regime is 
probably that it will provide a very powerful test of the results of numerical simulations, and the 
large-scale structure theories in which they are based. If the Lya forest arises from individual, 
high-density clouds, the fingers of God should be much more elongated and common than in 
the modern theories of gravitational instability from primordial fluctuations. Essentially, the full 
redshift space 2-point function gives a more quantitative version of the measurement of transverse 
sizes of the absorbers from coincidences of lines (Crotts &; Fang 1998 and references therein). 

On the large scales, an alternative to finding a large number of pairs of bright quasars to 
measure the anisotropy of the linear correlation function to high accuracy could be to work 
with spectra of much fainter, but much more numerous sources. So far, observational studies of 
the Lya forest have used bright quasars as sources because of the desired high resolution and 
signal-to-noise that is necessary to measure the properties of individual absorption lines. But the 
correlation function could in principle be obtained from spectra of much poorer quality, as long as 
a very large number of spectra are taken. Recently, large numbers of galaxies are being identified 
at high redshift with the method of the Lyman limit break technique (Steidel et al. 1996). Spectra 
are now being taken routinely of galaxies in the redshift range 2.5 to 4, which have a number 
density of ~ one galaxy per square arc minute (Steidel et al. 1998). If the stellar continuum of the 
galaxies can be modeled, the Lya forest spectra in these galaxies could provide a better way to 
measure the Lya forest correlation on large scales (in this case, Gaussianization could not be used 
because the transmitted flux is measured only with very poor resolution, so numerical simulations 
would be necessary to predict the correlation function of the smoothed transmitted flux). These 
observations could also be used to study the cross-correlation of the Lya forest with the Lyman 
limit break galaxies. 

Future work is needed along two paths: analysis of numerical simulations and observational 
data. CWKH showed that the Gaussianization method works fairly well when applied to numerical 
simulations. Direct application of the full method to recover the redshift space correlation function 
to numerical simulations should indicate the effect of non-linearities and give a more accurate 
estimate of the accuracy that can be achieved in the observational determinations. The effect of 
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the finite size of the numerical simulations should now also be more easily understood, since the 
analytic method used in this paper predicts how the correlation should behave on the large scales. 
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